Here I will clean the 2019_subway_rider_data

Now I will specifically clean the data within the columns

Cleaned rider frequency table:

converting complaints into long format:

Making some of the variables as a character and some as a factor!

<<<<<<< HEAD Converting zip codes into boroughs. I used https://www.nyc.gov/assets/planning/download/office/data-maps/nyc-population/census2000/sf1p11.xls to get majority of the borough’s zip codes. Any remaining ones, I looked up on Google. There were three zip codes that I could not find a match for, and were likely miswritten, so I removed those data rows.

Create complaint_data_time from rider_data_full with time information

Column

Chart A

Column

Chart B

Chart C